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Abstract. We study the statistics of turnout rates and results of the French elections since 1992. We find 
that the distribution of turnout rates across towns is surprisingly stable over time. The spatial correlation 
of the turnout rates, or of the fraction of winning votes, is found to decay logarithmically with the distance 
between towns. Based on these empirical observations and on the analogy with a two-dimensional random 
diffusion equation, we propose that individual decisions can be rationalised in terms of an underlying "cul- 
tural" field, that locally biases the decision of the population of a given region, on top of an idiosyncratic, 
town-dependent field, with short range correlations. Using symmetry considerations and a set of plausible 
assumptions, we suggest that this cultural field obeys a random diffusion equation. 
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Making decisions is an everyday necessity. In many cases. 
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these decisions are of binary nature: to buy or not to buy 



a product or a stock, to get vaccinated or not, to partici- 
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pate to a vote or not. Elections are in fact themselves often strict to binary choice situations, i.e, either the turnout 
binary, like referendums, or second round of presidential rates for each election, or the results of yes/no referen- 
elections, etc. The final decision of each individual is the dums or second round of presidential elections. We anal- 
result of many factors: individual propensities, common yse in details the statistics of these outcomes, with special 
factors (such as prices, reliability of the vaccine, impor- focus on the dependence of the results on the size of the 
tance of the election, etc.), and, in many cases, the deci- cities, and on the spatial correlations between the different 
sions of others play a major role as well. Whether wc like results. While there might be some indications of direct 
it or not, imitation is deeply rooted in living species as a imitation effects within towns, the structure of inter-town 
strategy for survival. We, as humans, are influenced by our correlations strongly suggests the existence of what we 
kindred both at a primitive level (fear of being excluded propose to call a 'cultural' field, that evolves in time ac- 
from the group) and at a rational level (others may pos- cording to a noisy diffusion equation. This cultural field 
sess some information unknown to us). Imitation can lead encodes local biases in intentions, convictions or propensi- 
to collective eff'ects like trends, fashions and bubbles that ties on a given subject, for example to vote or not to vote, 
would be difficult to understand if we were insensitive to or to vote left or right, to respect or not speed limitations, 
the behaviour of others. These imitation induced opinion etc. etc. These (subject specific) cultural fields transcend 
shifts can be beneficial for a society as a whole (as in the individuals while being shaped, shared, transported and 
case of vaccination), but can also be detrimental and lead transformed by them. Although the existence of such cul- 
to major catastrophes (crowd panic, financial crashes, eco- tural fields has been anticipated by sociologists, political 
nomic crises, rise of extremist ideologies, etc.). Develop- scientists and geographers (see [11] for an early insight, 
ing reliable models for these collective effects is therefore and [12,13] for more recent discussions), we believe that 
of primary importance, see [1-8]. This requires, in partic- our empirical results provide the first quantitative evidence 
ular, to garner quantitative empirical information about for such a concept, and lay forward the possibility of a pre- 
thc imitation processes that may induce strong distortions cise modelisation of its spatio-temporal evolution. Let us 
in the final outcome (see for example [9] for a precise ex- emphasise an important conceptual point: these cultural 
perimental set up to measure these effects, and [10] for a fields should exist independently of any election, or any 
theoretical framework.) other occasions where a decision has to be made. These 

events provide an instantaneous snapshot of the opinion 

In order to study the nature of these behavioural corre- 

or of the behaviour of individuals, which are in part infiu- 

lations, we have studied the space-resolved (town by town) 

enced by these fields, in a way that we will quantify below. 

results of the French elections since 1992. To keep the 
model and the interpretation of the results simple, we re- 
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The cultural field has a dynamics of its own, that we will 2 Empirical regularities and spatial 

model and elaborate on in section 4 below. , i^. ■ ^ 
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Most of the empirical electoral studies previously re- 
ported in the physics literature deal with proportional vot- 
ing from multiple choice lists, and investigate the distri- 
bution of votes; as in Brazil [14-17], in Brazil and In- 
dia [18], in India and Canada [19], in Mexico [20,21], in 
Indonesia [22]. A universal behaviour was reported in [23]. 
Statistical results of elections for the city mayor are stud- 
ied in [24], the typology of Russian elections in [25], cor- 
relations between electoral results and party members in 
Germany in [26] , and statistics of votes per cabin for three 
Mexican elections in [27]. Majority and Media effects were 
investigated for various countries in [28] . Lastly, the spread 
of Green Party in several states in the USA is analysed by 
means of epidemiological models in [29], and data from 
a Finland election is confronted to a Transient Opinion 
Model [30]. See also [31,32] for Political Science studies of 
electoral participation. 

The specific feature of the present work is the quan- 
titative analysis of the spatial correlations of the voting 
patterns. We will first describe several striking empirical 
regularities in the French vote statistics (that we believe 
are not restricted to French elections). We then turn to 
simple models that help putting these findings in context, 
and explain why the idea of a diffusive cultural field, which 
is the central proposal of this work, naturally accounts for 
some of our findings. 



We have analysed the turnout rate of all French elec- 
tions [33] with national choices since 1992 (13 events). 
A subset of 6 elections offered a binary choice to voters: 3 
referendums and 3 second round of presidential elections 
(See Table 1 for details and summary statistics). The na- 
tional results are broken down into ~ 36, 000 local results, 
corresponding to communes (towns), of various popula- 
tion sizes. For each voter i, we define 5, = 1 to correspond 
to participation to the vote or belonging to the majority 
vote, whereas Si — corresponds to not participating, or 
belonging to the minority vote. For a town a, the total 
number of potential voters is Na, the total number of vot- 
ers is Va = '^i' the turnout rate is Pa = Va/Na and 
the total number of winning votes is Wa- We found con- 
venient to work with logarithmic rates for participation or 
winning votes: = ln{Va/No: — Va) = ln(p„/l — Pa) and 
Pa = ln{Wa/Va — Wa)- Each commune is characterised by 
the spatial coordinates [34] of its mairie (town-hall), Ra- 
The distance between two communes, raff, is defined as 
rafi = \Ra — Rp\ (even if the presence of - say - moun- 
tains or rivers in between would make the actual travelling 
distance much longer). 

The issues at stake in all these elections are clearly 
very different, and so it is not a priori obvious that any- 
thing universal (across different elections) can be found in 

the statistics of votes. However, to our surprise, we found 
a number of empirical regularities that we now detail, fo- 
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Table 1. Summary statistics of the elections studied in this paper. R.: referendum (1992: Maastricht treaty, 2000: reduction of 
the presidential mandate to 5 years, 2005: European constitutional treaty); E.: European parliament election; P.l: first round 

of a presidential election; P.2: second round of a presidential election. Moan value (mean), standard-deviation (sd), skcwncss 
(skew) and kurtosis (kurt) of logarithmic turnout rates Ta (left), and of logarithmic winning vote rates Pa (right). Data is for 
metropolitan Prance only. 



election 


kind 


V V 


mean 


sd 


skew 


kurt 


V w 


mean 


sd 


skew 


kurt 


1992-b 


R. 


0.713 


1.13 


0.355 


1.05 


5.48 


0.508 


-0.164 


0.447 


-0.159 


2.48 


1994-m 


E. 


0.539 


0.358 


0.398 


0.837 


9.23 












1995-m 


P.l 


0.795 


1.60 


0.375 


0.928 


5.37 












1995-b 


P.2 


0.805 


1.72 


0.398 


1.35 


5.54 


0.525 


0.187 


0.524 


0.357 


2.71 


1999-m 


E. 


0.478 


0.146 


0.392 


1.15 


7.50 












2000-b 


R. 


0.308 


-0.626 


0.377 


0.858 


8.27 


0.729 


0.874 


0.498 


-0.116 


2.77 


2002-m 


P.l 


0.729 


1.24 


0.347 


1.25 


9.46 












2002-b 


P.2 


0.810 


1.67 


0.367 


1.26 


6.40 


0.820 


1.48 


0.521 


0.776 


2.26 


2004-m 


E. 


0.434 


-0.095 


0.366 


1.45 


9.82 












2005-b 


R. 


0.711 


1.13 


0.351 


1.58 


12.0 


0.550 


0.377 


0.443 


-0.021 


1.37 


2007-m 


P.l 


0.854 


1.98 


0.396 


1.06 


8.02 












2007-b 


P.2 


0.853 


1.99 


0.394 


1.22 


5.28 


0.533 


0.257 


0.487 


0.174 


2.31 


2009-m 


E. 


0.414 


-0.147 


0.360 


1.35 


8.28 













cussing first on turnout rates where these regularities are 
most robust. Very similar results are also found for win- 
ning votes - see below. 

The simplest quantity to look at is the pdf of the loga- 
rithmic turnout rate r over different communes, i.e. what 
is the probability P(r)dT that a given commune, irrespec- 
tive of its size, has a turnout rate r to within dr. Al- 
though the average turnout rate m = (r) varies quite 
substantially between elections (see Table 1), the shape 
of the distribution of f = r — m is remarkably constant 



- even without rescaling by the root mean square a, see 
Fig. 1. The first three cumulants of P(r) are, within error 
bars, the same for all elections (see Table 1). In fact, a 
Kolmogorov-Smirnov test where one only allows for a rel- 
ative shift of the distributions does not allow one to reject 
the hypothesis that P(j/a) is indeed the same for all elec- 
tions (except perhaps 2009 which gives marginal results). 
Note that the distribution of r is clearly non Gaussian, 
with significant positive skewness and kurtosis. 
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■ i995-m 
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• i999-m 
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2005-b 
2007-m 
2007-b 
2009-m 




T-<T> 

Fig. 1. Distribution over communes of P(r — (r)) for each 
election, where r is the logarithmic turnout rate and (r) its 
average over all communes. The suffix -b means the election 
is of binary nature, while -m means multiple choice elections. 
The inset shows the distributions of direct turnout rates, p, for 
each election. 

The statistics of r does in fact depend on the commune- 
size N. We find that both the mean and variance of the 
conditional distribution P{t\N) decrease with N ~ see 
Fig. 2; in particular, small communes have a larger av- 
erage turnout rate (this explains the positive skewness of 
P{t) noted above [35]), but also a larger dispersion around 
the average. This is of course expected for a simple bino- 
mial process, which predicts that cr^(A^) — l/Np{l — p) + 
17^, where p is the (iV-dependent) average turnout rate, 
whereas describes the 'true' variance of the turnout 
rate. The simplest assumption is that is iV-independent, 
in which case the observed variance varies significantly 
faster (by a factor 2 or so) than the simple binomial pre- 
diction that assumes independent voters, see Fig. 2, inset. 
A possible interpretation is that the votes of different in- 
dividuals are not independent within the same commune, 



0,2 0,3 0,4 

< l/(Np(l-p))> 



Fig. 2. Mean (main figure) and variance (inset) of the con- 
ditional distribution P{t\N). These quantities are obtained as 
averages over bins with one thousand communes of size ~ A''. 



leading to an effectively smaller value of N in the bino- 
mial. One can for example assume that within families, 
or groups of close friends, the decision to vote or not to 
vote is exactly the same. If the size distribution of these 
groups of "clones" is Q{s), then it is easy to show that for 
N large enough, these correlations in decision amount to 
replacing N by N{s)q/{s'^)q < N. An explicit example 
is Q{s) = (1 — z)z'^^^ for s = 1,2,.. ., which leads to an 
effective value of N reduced by a factor (1 — z)/(l + z). 
In order to account for a factor of 2 in the variation of cr^ 
with N, we therefore need to choose z « 1/3. Since z is 
the probability that the group is larger than s = 1, this 
looks quite large. In fact, as we will discuss below, there 
is an alternative interpretation of the excess variance that 
does not rely on direct imitation. 

Let us now turn to the spatial correlations between 
turnout rates, measured by the following correlation func- 
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tion: curvature. The value of the cut-off distance L is similar 

((tq — ma)(r« — ma))! „ 
Crir) = , (1) to one reported above, whereas the slope A tends to 



((t„ - ma)2) 

where nia is the average of r over all communes with size 
in the same "bin" as that of a, so as to remove system- 
atic spatial correlations between town sizes. The central 
result of our empirical study is that for all elections, Cx(r) 
is long-range correlated. It is found to decay as the loga- 
rithm of the distance (see Fig. 3, left): for < r < L, 
C'rir) = -X^ln{r/L), and C^(r > L) « 0. Whereas the 
logarithmic slope depends on the election and varies 
by a factor at most 2 (between 1999 and 2007) the cut- 
off distance L is, remarkably similar for all elections, with 
L « 300 km. In order to visualise more directly the corre- 
lations between neighbouring towns, we have also studied 
the relation between and the average of rp over the 16 



be larger, except in the 2000 election (see [35] for more 
details, and also for more statistical regularities in these 
elections.) 

3 Theoretical insights and threshold models 

The long-range, logarithmic dependence of the correlation 
functions Cr(r) and Cp{r) is the most striking finding of 
our study, in particular because it is strongly reminiscent 
of the behaviour of the correlation function of a free diffu- 
sion field in two dimensions. More precisely, let <j){R, t) be 
a two-dimensional field that obeys the following stochastic 
dynamical equation: 

^'''^^'^^ --DAcj^{R,t)+v{R,t), (2) 



/3 towns closest to a. We find a very good linear regression dt 

between the two over the whole range of r's, with a slope ^]^^^^ ^ ^i^^ two-dimensional Laplacian, D is & diffusion 

equal to unity (not shown, see [35]). constant and rj a Langevin noise with zero mean, variance 

The same general picture holds for the statistics of ^2 ^^^^ short-range correlations both in time and in space, 

winning votes, except that: (a) the distribution of the log- jt is well known that the equal-time correlation of (p is (in 

arithmic winning rate P{p) is much less universal across equilibrium) given by: 



elections than P{t). We have noticed in particular that 



r 



^^(^)= '"u(n\2\" --^'In-, 4«r«L, (3) 

the skewness of P{p) varies significantly between elections, / ^ 

and changes sign between presidential elections, where it where £c is a short scale cut-off (for example the corre- 

is positive, and referendums, where it is negative. Con- lation length of the noise rj) and L is the linear size of 

trarily to {t)\n, {p)\n does not show any systematic pat- the system. The behaviour of C^(r) for r < ic and the 

tern, whereas crp{N) is again significantly larger than the logarithmic slope A"^ depend on short scale details of the 

simple binomial prediction; (b) the spatial correlation of model, but not on the the diffusion constant D. The time 

winning votes Cp is also logarithmic with r (see Fig. 3, to reach equilibrium, beyond which the above result holds, 

right), although the 2002 result shows a more pronounced is Tgg = L'^/D. The logarithmic behaviour is a hallmark 
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of the two-dimensional nature of the problem. The strik- or by the intentions ^pj of others (see e.g. [1,4,5,7-9,42]). 
ing similarity between this prediction and our empirical For example, the decision to go see a movie, or to carry- 
findings is the motivation of the theoretical analysis that ing on clapping at the end of concerts or shows, may well 
we present now. depend on what others have done in the past or are cur- 

There is quite a large literature on binary decision I'ently doing. In the first example, the number of people 
models (see [1,36] for classical references, and [5,4,7,8, leaving already seen the movie feeds back on the motiva- 
37-40] for more recent contributions), although the spa- ^^^^ of t^^ose who have not yet seen it, through box-office 
tial correlations that we want to include appear to be new. charts or word of mouth. In the second example [7], the 
1 It is natural to think about these situations in terms of amplitude of clapping generated by the rest of the crowd 
thresholds [1]: although the decision is binary, the process can be directly perceived and is an incentive to continue 
leading to the final choice is in fact continuous and reflects ^^e applause. But in other situations where the decision is 
the individual motivations, propensities or utilities, that taken simultaneously and the final result is only known a 
we will call the intention ^pu where i labels the individu- posteriori (like in elections), this direct imitation mecha- 
als. When exceeds a certain threshold the decision nism cannot be present - although of course the influence 
S is one way - say 5 = 1; when <p is below this thresh- of intentions is possible, and will be expUcitly included in 
old, 5 = 0. The process ip is in general time dependent model below. 

because individuals are influenced by a variety of factors: The above discussion suggest a general decomposition 
what they read, what they hear, what they see contribute of the individual intention field into an idiosyncratic 
to the way they understand a given situation and react to Part, a 'cultural' part and an imitation part. More for- 
it. Some of these influences are idiosyncratic, i.e. unique mally, for an individual i living in the vicinity of R, the 
to each individual (for example, one may be iU on an elec- intention at time t is written as: ^ 

tion day, meaning that the propensity to go out and vote ^ . ^ ^ . + ^^^^ + ^ j.. S^{t-l); (4) 

is very low), while others are common to people living in 

where ei{t) is the instantaneous contribution to the in- 

the same area (for example, the quality of public goods 

tention that is specific to i, and (j){R,t) is an average of 

in a given town, or the unemployment rate, etc.). Finally, 

the intentions of the fellow denizens expressed in a recent 

an important influencing factor is the decision Sj made 

past. This average can be seen as a space dependent 'cul- 

by others, if it is known before making one's own mind, 

tural' fleld which encodes all the local, stable features that 



1 We are aware of a "spatial theory of political choices" (see influence the final decision. In essence, this component 
[41]), but this is a misnomer; 'spatial' in that case refers to a ^ Memory effects, or more complicated time dependent ef- 
distance in the abstract space of convictions. fects can be implemented along the lines of [43,7]. 
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Fig. 3. Spatial correlations Crir) of logarithmic turnout rates (left) and Cp{r) of winning votes (right), showing a logarithmic 
dependence on r with a cut-off distance L ~ 300 km. The average logarithmic slope is found to be ~ 0.065 ±0.01 for turnout 
rates, and ~ 0.11 ± 0.02 for winning votes (removing the 2000 referendum on the reform of the presidential mandate, considered 
to be a very technical subject for which the turnout rate was exceptionally low, see Table 1.) 



should be relatively smooth over both space and time. The 
last term describe the influence of the decision of others, 
with couplings Jy that measure the strength with which 
the decision/intention of individual j influences i. J > 
means conformity of choices, whereas J < encodes dis- 
sent. Many situations (like the movie and the clapping 
examples given above) are described by a mean-field cou- 
pling to the average decision of others: = Jq/N, Vi ^ j, 
or to the average intention of others. Finally, the decision 
rule is S^{t) = 0{(p^{t) - <Pc), with e{x > 0) 1 and 
0{x < 0) = 0. ^ Note that within such a threshold model, 
the intention field is defined up to an arbitrary scale and 
shift. There is no physical unit of intention, nor any par- 



The process leading from (p to S is assumed to be deter- 
ministic and instantaneous when the final decision is taken. 
But in fact, one could add an extra source of randomness by 
assuming that the probability to choose 5=1 grows as a cer- 
tain smooth function of 99 — <f c without changing the essence 



ticular meaning to 1^9 = 0: only differences of intentions 
can matter in the evolution of the (pi's. 

Note that without the 'cultural field' 4>{R, t) and with 
a random static idiosyncratic term e^, Eq. (4) boils down 
to the Random Field Ising Model [44,45], with first appli- 
cations to social dynamics appearing in [46,47], see also 
[7,8]. For early studies and/or critical refiections in socio- 
physics [48], see e.g. [49-54]. 

As noted above the intra- commune correlations be- 
tween votes may be due to direct imitations between mem- 
bers of the same family (between which intentions are of- 
ten shared). But the long-ranged spatial correlations can- 
not be due to the imitation of decision term. One rea- 
son is, as noted above, that elections are not situations 
where the actual decision of others can matter, since it 



of the discussion to follow. In fact, this randomisation can be 
re-absorbed in an appropriate change of P(e). 
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is known too late. Interestingly, however, the data itself ing discussion particularly transparent, i.e.: 

strongly rejects a model where the field (j){R^ t) is short- \ 

V>{x) = —, (6) 

ranged correlated, while assigning the spatial correlations 



1 + e" 



of r to a coupling term between nearby communes, '^l^ere n is the average of the e and the width of P{e) 

The reason this model cannot be made to work is the ^^^^ assume, for simpUcity, to be constant in space 

following: for long-ranged correlations to emerge, the cou- ^nd in time, and that we set equal to unity in the fol- 

pling J must be such that the system is close to its critical lowing. The average /j, may itself depend both on space 

point Jc, beyond which imitation is so strong that the so- ^nd time, and can be seen as an extra, commune specific 

lution of the coupled equations giving the {Si} becomes spatial noise that adds up to the smooth cultural field 

multi-valued. But when this is the case, the corresponding " In fact, we will subtract from 4>{R) any short-range 

distribution of turnout rates becomes very wide, or even correlated component that we assign to ii(R), such that 

bimodal, and negatively skewed in a way that is incom- by definition, the correlation function of the </> field C^(r) 

patible with the unimodal, positively skewed and rather ^^nds smoothly towards 1 when r ^ 0+. 

narrow distribution observed empirically (see Fig. 1). A With this particular logistic choice above, one finds: 
detailed discussion of the inadequacy of this model in the 



r„=ln(^) 



case of elections is provided in [35]. The long-ranged cor- 
relations of r should, we believe, be sought in the spatial 

structure of the cultural field <^{R, t). Up to a shift and a noise component that vanishes when 

Na — )• oo, Ta and (f>{Ra) are now the very same object. 

Prom now on, we therefore neglect the direct imitation p^j. ^^Yict choices of the distribution of e, such a strict 

component in Eq. (4) above. We introduce the cumulative identification is not warranted, but we expect that both 

distribution of the instantaneous, idiosyncratic component object share similar statistical properties, 

e: V>{x) = /~ dx'P{e). Calling tt^ = V> (^c - (t>{Ra)) Within the above identification, the variance of t^, is 

the theoretical turnout rate of commune a, the realized given by- 
turnout rate is given by: 



1 



-)^al[l + A], (8) 



'Np{l-p)' 

where the ratio of variances A is introduced for later con- 



jVc, ^ * ~ " y Na { a ^ ) venience. The variance of the logarithmic turnout rate is 

(5) * We actually expect /x to be different in different neighbour- 
where ^ a standardised Gaussian noise. We will assume a hoods of the same city, reflecting socio-professional or ethnic 
logistic distribution for P(e), which will make the follow- intra- communes variability. 
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therefore the result of three effects: (1) the fluctuations might in fact not be warranted). Interestingly, we have 

of the smooth cultural field, cr^. This quantity is not at- found that the following relation accounts very well for 

tached to a particular commune and therefore cannot de- the data: 

pend on N; (2) the fiuctuations of the average intentions / i \ 

{COYr{t,t))t « B{COYrit,t'))t^t' + ( JVpH - p) / ' ^^^^ 

in a given commune, cr^, that may well depend on A'' (a ^ ^ ^ ' 

naive guess would be as AT"!); and (3) the binomial noise where B w 1.5. The remarkable point of this analysis is 

effect, that scales as N'^. So the extra noise found in the that the coefficient of the binomial contribution is found 

data (see Fig. 2), that we interpreted above as a sign of ^ be exactly unity, meaning that within this interpre- 

intva.- commune herding, may in fact reveal the contribu- tation one does not need to invoke the presence of intra- 

tion of (7^ . communes "clones" , or more precisely that the probability 

A way to distinguish the two interpretations is to com- ^ ^erds is too small to be detectable using our data set 

pute the covariance of the t„ for different elections as a ' ^he coefficient B simply accounts for the average decay 

function of N, and compare it to the variance of t„, plot- °f ^he correlation of and /x as a function of time, and its 

ted in Fig. 2. Since the binomial noise is uncorrelated from ^^1"^ compatible with the results shown in Fig. 5 below, 
election to election, its contribution must drop out from 

the covariance of r for different elections, averaged over 4 The random difTusion equation 

all communes: 

We will now argue why it is natural to expect that the 

Covr{t, t') = {Ta{t)Ta{t'))c Cultural field ^{Ra,t) should obey a noisy diffusion equa- 

= {ct>ait)Mt'))c + {Mt)Mt'))c, {t + t'), (9) ^^'"^ *° with an extra global, time dependent 

driving term. Although (j){Ra,t) is an rather abstract ob- 

where the index 'c' means that we take a cumulant average ^^^^^ existence of which we postulate, its time evolu- 

over all pairs of elections and all communes. If the bino- ^^^^ ^^^^^^^ ^^^^^^^ ^ ^^^^^0^^ ^^^^ ^^^^ describes events 

mial noise was the only source of the N dependence of ct^, ^j^^^ ^^^^^^^ ^^^-^ ^j^^^ ^^^^^ contribute 

the above covariance of the rs at different times should be changing the overall mood of a given city, such as the 

independent of N . This is not what the data shows. There ^^^^^^^ ^^^^ ^ factory or of a military base, important 

is indeed a residual TV-dependence that must be ascribed ^j^^j^ggg population, or a particularly charismatic local 

to the statistics of the average of idiosyncratic intentions, ig^der, for example. This is captured by the term tj{R, t), 

IJ,{R) (although we have no interpretation for the ~ iV-3/2 s ^^^^ leaving the coefficient B' in front of the binomial 

dependence that we observe. The assumption that the dis- contribution free, a regression analysis leads to B' 0.7 < 1, 

persion of individual biases, ae, is commwne-independent whereas the presence of "clones" would require B' > 1. 
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which we imagine to be correlated in space over some sparsely populated areas (because of the larger distance 
length scale Ic comparable to the typical inter- commune between neighbouring towns). 

distance, and over a time of, plausibly, several months. Our final equation, that respects all the symmetries of 

The Laplacian term describes the fact that people them- the model ^, is therefore (in the continuum limit): 
selves move around and carry with them some compo- FiMTt i\ 

= V • [D{R)Vct>{R, t)\ + n{R, t) + v{R)F{t), 

nents of the local cultural specificity (l){Ra,t)- This can 
be cither by actually moving to a nearby city, or by just 

where F{t) represents the public information, for exam- 
visiting or interacting with acquaintances from the neigh- 

ple the subject and importance of the election, national 

bouring cities. The all year round exchange of ideas, infor- 

TV programs, etc. In principle, different communes may 

mations and experiences must lead to a local propagation 

react differently to the same public information, leading 

of (j){Ra,t). Since only difference of intentions matter, the 

to space dependent 'reactivity' z^(i?), but we will neglect 

evolution of the cultural field at -R^ can only depend on 

these spatial fluctuations in the following. ^ Similarly, we 

the differences (^(i?^, t) — (p{Ra,t); furthermore, the model 

assume that the variance and higher cumulants of rj are 

must be invariant under a change of scale of (p. In order to 

homogeneous and independent of the size of the commune, 

satisfy these constraints, the most general term describing 

but this assumption could be relaxed if needed. Because 

the evolution of (t){Ra , t) due to the surrounding influences 

any average trend is described by F{t), the noise i]{R, t) 

takes the following, linear form: 

is of zero mean. The average trend can be accounted for 
'-^^[^^=^r^mR,,t)-m.M, (11) by a uniform shift of 0: 
where rap{raff) > is a symmetric influence matrix, that 

(j){R, t) (t){R, t)-v I dt'F{t'). (13) 
we assume to decrease over a distance corresponding to ■'O 

. , , ., 1. , , „.,..,, _ , ^ Note that 6 —6 is another symmetry of the model, 

typical daily displacements of individuals, say 10 km or 

corresponding to the fact that the propensity to do something 

so. The above equation means that through human inter- 

is related to minus the propensity not to do the same thing, 

actions, the cultural differences between nearby cities tend 

since 0{ifi — <?c) = 1 — 0(^c — 'fi)- The scale invariance and 

to narrow. As is well known, the continuum limit of the 

the shift symmetry of the threshold model in fact only allow 

right hand side of Eq. (11) reads V ■[D{R)V 4>{R,t)\, with 

higher order linear derivatives to appear, such as V*0, etc. 

D{Ra) = \ ''"a^^aff is & (space dependent) measure of 

These terms do indeed appear in a gradient expansion of Eq. 

the speed at which the cultural field diffuses. This spatial 

(11) above. 

dependence is a priori expected: for example, D{R) should Short-ranged correlated noise in z/(J?) does not affect the 
presumably be smaller in mountain regions (because of logarithmic behaviour of C^(r) provided F{t) has a finite cor- 
the difficulty to travel from one valley to the next) or in relation time. 



12 C. Borghesi, J.-P. Bouchaud: Spatial correlations in vote statistics: a diffusive field model for decision-making 

This explains why the average value of (j), and therefore larger than the correlation length of D{R), the effective 

of the turnout rate r, can change substantially over time, equation becomes identical to Eq. (2) with an effective 

while the fluctuations of and its spatial correlations re- diffusion constant D that can be computed using e.g. per- 

main essentially stable - as found empirically, see Figs. 1 turbation theory, or effective medium approximations [55]. 

and 3. What is of interest for our discussion is that the logarith- 

When D{R) = D is uniform, we recover Eq. (2) and mic dependence of C^{r), Eq. (3), is still valid on large 

the logarithmic decay of the correlation function Cr{r) (x enough scales. 
C,p{r) follows immediately, since the contribution of the 



idiosyncratic /U field and the binomial noise, both short 
ranged in space, rapidly disappear when r > 0. Since we 

have removed from (j) its short range spatial contribution, 
the coefficient A"^ appearing in Eq. (3) is given by l/lnj-. 
Therefore, 



In order to test these ideas more quantitatively, we 
have simulated the model using Eq. (11) with the exact lo- 
cations of all french communes, 

spatially inhomogeneities are therefore treated exactly. We 
show the resulting spatial correlations of in Fig. 4- 

left, for different values of £c, in the range [1.5,7.5] km. 



(J -J. Q\ — I pa X — (14) 

'^^11 \-\- A A ln|^' comparable with the inter-town average distance. We 
where A is the ratio of the variance of the idiosyncratic and see that C^(r) is indeed approximately logarithmic and 
binomial noise to that of the cultural field, defined in Eq. looks actually very similar to the empirical curves. Note 
(8) above. From Fig. 3, one sees that 1/(1 + A) ^ 0.2 — 0.4 in particular that while the small r value of C^{r) and 
for turnout rates: the cultural field explains roughly a the logarithmic slope significantly depends on i^-, as 
third of the variance of the local results. This ratio in- expected, the point r at which C^{r) reaches zero is ap- 
creases to one half for winning votes, except for the 2000 proximately constant, and, remarkably, very close to its 
referendum (see Fig. 3, right plot), meaning that the role empirical counterpart (L k, 300 km, on the order of the 
of the cultural field is arguably stronger for polarized po- size of the system). It is tempting to associate the apparent 
litical decisions than it is for the question of participating dependence of on the election to a temporal evolution 
or not to a vote. of Ic- However, this is not correct: our numerical simu- 
When -D(-R) is non uniform, the equilibrium correla- lations show that even when the system is equilibrated 
tion of (/>(-R) cannot be computed in general. However, (i.e. when t ^ L'^/D), the measured Cfj,{r) for a fixed 
drawing analogies from physics [55], we know that this in- value of Ic fluctuates quite strongly between different re- 
homogeneous diffusion equation can be, under rather gen- alizations of the noise ?7, see Fig. 4-right. The range over 
eral hypothesis on the statistics of (/I), "homogeneized" . which A^ varies in the simulation is very similar to what 
This means that coarse graining on sufficiently large scales, is found empirically. Therefore, the empirical results are 
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1 =4.5 km 
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Fig. 4. Numerical simulation of the diffusive field model, with the true positions of the French communes, and an interaction 
between nearby communes decaying as exp(— r/^c)- Left, we show the correlation C4,{R) as a function of Ic, after transient 
regime and averaged over many realizations of the 'noise' rj. The distance R is here measured in units of the average inter- 
commune distance (that depends on the departement. R — 100 corresponds to 270 km, precisely the scale at which the empircal 
correlations reach zero. Right, ten realizations of C',j,{R) for £c = 4.5 km, still after transient regime, as a function of the scaled 
distance R. In the inset, value of the nearest neighbour correlation C{R = 1) as a function of 'time', showing strong fiuctuations. 



fully compatible with a fixed value of £c- Matching the 
average empirical slope to the numerical data and taking 
into account the value of A suggests that £c must be very 
small, of the order of a few kms, fully compatible with our 
interpretation. We have checked in our numerical simula- 
tions that our diffusive field model indeed reproduces the 
strong linear correlation between Tq, and the average value 
of T/j over the neighbouring towns, with unit slope. This is 
of course expected and is the essence of the model, since 
the Laplacian coupling in Eq. (2) says precisely that on av- 
erage the Laplacian of (j) is zero, i.e. that (pa is on average 
equal to J2p£n{a) '^0' where n{a) is the neighbourhood of 
a. This empirical finding can in fact be seen as a direct, 
microscopic motivation for postulating Eq. (11) above. 



In order to estimate the diffusion of intentions D, one 
can assume that a substantial fraction of the population of 
a given town experiences some interaction with neighbour- 
ing towns after several weeks. For opinions to get closer, 
this time is probably more on the scale of months. Taking 
the relevant inter-town distance to be ^ 10 km leads to a 
diffusion constant of the order oi D ^ 100 km^ /month or 
less (ways of obtaining more precise empirical estimates of 
this quantity would be important here). The correspond- 
ing equilibration time T^q over L = 300 km is therefore 
quite long, a century or more. But this is compatible with 
the fact that the voting habits of the different regions seem 
to be extremely persistent in history - which is in itself a 
strong argument for the existence of a cultural field that 
keeps the memory independently of the presence of partic- 
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ular individuals. This justifies the apparent correlation be- 
tween geography and votes discussed long ago by Siegfried 
[11] {Le granite vote a droite, le calcaire vote a gauche). In- 
terestingly, a more precise prediction of our model is that 
the temporal autocorrelation of (j) for a given R should 
behave as: 
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where the slope is exactly one half of the slope of Ctjjir) 
versus — Inr, and Tc the correlation time of the noise 77. 
We have tested this prediction directly on data, by study- 
ing the autocorrelation of the fluctuations of the partici- 
pation rate for a given commune across different elections. 



It. - tl (in weelcs) 



Fig. 5. Temporal correlation Crit) of turnout rates r. Here, 
t — \ti — tj\ where ti and tj are the election dates of elections. 
The red line shows the expected Crit) ~ — iA^lii(t) where 
~ 0.065 is the averaged slope over all elections of the spatial 
correlations Cr{r) ~ — ln(r). The blue line shows the linear 
regression of Crit) as a function of ln(t), here with a slope of 
« -0.066. 



averaged over all communes. Although the data is noisy, 

see Fig. 5, the resuhs are indeed compatible with a log- In any case, in the absence of more information on the 

arithmic decay in time with a slope that has the correct dynamics of the ^ field, we find the overall agreement be- 

order of magnitude. From the spatial correlations of r we tween the model and the behaviour of these temporal au- 

find w 0.065 (averaged over all elections - see Fig. tocorrelations satisfying. 

3), whereas the linear regression of Cr{t) as a function 

of Ini gives as slope of ~ —0.066. The discrepancy with _ _ , . 

5 Conclusion 

the factor 2 predicted by the theory could be related to 

the fact that the autocorrelation in time is substantially Let us summarise the main messages of this study. First, 

larger than the correlation in space (compare Figs. 3 and the statistics of the turnout rates in French elections is 

5), meaning that the idiosyncratic field fi{R,t) has sub- found to be surprisingly stable over time, once the aver- 

stantially longer temporal correlations than spatial corre- age turnout rate is factored in. The size dependence of the 

lations. If the correlation time of the noise 77 is of several turnout rate variance may have suggested some intra city 

months, the contribution of the decorrelation of fi{R, t) to 'herding' effect, but we believe that the data is more con- 

Cr{t) probably interferes with the contribution of the cul- sistent with small towns having a larger dispersion in local. 



tural field (p ^-nd effectively increases the empirical slope, idiosyncratic biases. A convincing argument for why this 
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should be so is however lacking. An explanation could be the idiosyncratic field, are clearly needed at this stage. We 
the systematic difference in the socio-cultural background hope that our work will motivate more empirical studies 
of the population of large cities compared to that of small to refine and calibrate the model proposed here, 
cities. 
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